Sequencing and Analysis of Approximately 40 000 Soybean cDNA Clones from a Full-Length-Enriched cDNA Library

نویسندگان

  • Taishi Umezawa
  • Tetsuya Sakurai
  • Yasushi Totoki
  • Atsushi Toyoda
  • Motoaki Seki
  • Atsushi Ishiwata
  • Kenji Akiyama
  • Atsushi Kurotani
  • Takuhiro Yoshida
  • Keiichi Mochida
  • Mie Kasuga
  • Daisuke Todaka
  • Kyonoshin Maruyama
  • Kazuo Nakashima
  • Akiko Enju
  • Saho Mizukado
  • Selina Ahmed
  • Kyoko Yoshiwara
  • Kyuya Harada
  • Yasutaka Tsubokura
  • Masaki Hayashi
  • Shusei Sato
  • Toyoaki Anai
  • Masao Ishimoto
  • Hideyuki Funatsuki
  • Masayoshi Teraishi
  • Mitsuru Osaki
  • Takuro Shinano
  • Ryo Akashi
  • Yoshiyuki Sakaki
  • Kazuko Yamaguchi-Shinozaki
  • Kazuo Shinozaki
چکیده

A large collection of full-length cDNAs is essential for the correct annotation of genomic sequences and for the functional analysis of genes and their products. We obtained a total of 39,936 soybean cDNA clones (GMFL01 and GMFL02 clone sets) in a full-length-enriched cDNA library which was constructed from soybean plants that were grown under various developmental and environmental conditions. Sequencing from 5' and 3' ends of the clones generated 68 661 expressed sequence tags (ESTs). The EST sequences were clustered into 22,674 scaffolds involving 2580 full-length sequences. In addition, we sequenced 4712 full-length cDNAs. After removing overlaps, we obtained 6570 new full-length sequences of soybean cDNAs so far. Our data indicated that 87.7% of the soybean cDNA clones contain complete coding sequences in addition to 5'- and 3'-untranslated regions. All of the obtained data confirmed that our collection of soybean full-length cDNAs covers a wide variety of genes. Comparative analysis between the derived sequences from soybean and Arabidopsis, rice or other legumes data revealed that some specific genes were involved in our collection and a large part of them could be annotated to unknown functions. A large set of soybean full-length cDNA clones reported in this study will serve as a useful resource for gene discovery from soybean and will also aid a precise annotation of the soybean genome.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

FULL-malaria: a database for a full-length enriched cDNA library from human malaria parasite, Plasmodium falciparum

FULL-malaria is a database for a full-length-enriched cDNA library from the human malaria parasite Plasmodium falciparum (http://133.11. 149.55/). Because of its medical importance, this organism is the first target for genome sequencing of a eukaryotic pathogen; the sequences of two of its 14 chromosomes have already been determined. However, for the full exploitation of this rapidly accumulat...

متن کامل

Identification of cDNA clones encoding valosin-containing protein and other plant plasma membrane-associated proteins by a general immunoscreening strategy.

An approach was developed for the isolation and characterization of soybean plasma membrane-associated proteins by immunoscreening of a cDNA expression library. An antiserum was raised against purified plasma membrane vesicles. In a differential screening of approximately 500,000 plaque-forming units with the anti-(plasma membrane) serum and DNA probes derived from highly abundant clones isolat...

متن کامل

Cost-Effective Sequencing of Full-Length cDNA Clones Powered by a De Novo-Reference Hybrid Assembly

BACKGROUND Sequencing full-length cDNA clones is important to determine gene structures including alternative splice forms, and provides valuable resources for experimental analyses to reveal the biological functions of coded proteins. However, previous approaches for sequencing cDNA clones were expensive or time-consuming, and therefore, a fast and efficient sequencing approach was demanded. ...

متن کامل

Assessment of Redundancy and Full-Length Rate of Full-Length Enriched cDNA Libraries

Collection of full-length genes requires libraries with full-length cDNA insert, large-scale sequencing, library assessment, and high-speed sequence clustering. Here we focus on computational methods, such as newly developed computer programs, since our experimental methods had been published previously. Our purpose is the collection of full-length cDNAs, therefore the proportion of full-length...

متن کامل

PEDE (Pig EST Data Explorer) has been expanded into Pig Expression Data Explorer, including 10 147 porcine full-length cDNA sequences

We formerly released the porcine expressed sequence tag (EST) database Pig EST Data Explorer (PEDE; http://pede.dna.affrc.go.jp/), which comprised 68,076 high-quality ESTs obtained by using full-length-enriched cDNA libraries derived from seven tissues. We have added eight tissues and cell types to the EST analysis and have integrated 94,555 additional high-quality ESTs into the database. We al...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes

دوره 15  شماره 

صفحات  -

تاریخ انتشار 2008